perm filename INTRO[0,BGB]6 blob sn#102638 filedate 1974-05-22 generic text, type C, neo UTF8
COMMENT āŠ—   VALID 00006 PAGES
C REC  PAGE   DESCRIPTION
C00001 00001
C00002 00002	TITLE PAGE - 1.						   JUNE 1974.
C00005 00003		CONTENTS:
C00006 00004	TITLE PAGE - 2.						   JUNE 1974.
C00009 00005	0.0	INTRODUCTION.
C00012 00006	
C00017 ENDMK
CāŠ—;
TITLE PAGE - 1.						   JUNE 1974.

draft - draft - draft - draft - draft - draft - draft - draft - draft



               GEOMETRIC MODELING FOR COMPUTER VISION.


                          BRUCE G. BAUMGART


ABSTRACT:

	The main idea of this thesis is that a 3-D geometric model of
the physical  world is an essential part  of a general purpose vision
system.  Such a model provides a goal for descriptive image analysis,
an origin for image synthesis (for  verification),  and a context for
spatial  problem solving.  Some of the  design ideas  to be presented
have been  implemented  in two  programs named  GEOMED  and CRE;  the
programs  are demonstrated  in the  context of  viewing objects  on a
turntable.



---------------------------------------------------------------------
This  research  was  supported  in  part  by  the  Advanced  Research
Projects  Agency of  the  Office of  the Secretary  of  Defense under
Contract No. SD-183.

The views and  conclusions contained  in this document  are those  of
the author and should not  be interpreted as necessarily representing
the  official policies, either expressed or  implied, of the Advanced
Research Project Agency or the United States Government.
	CONTENTS:

{INTRO}		0.	INTRODUCTION.
{GEM}		1.	GEOMETRIC MODELING THEORY.
{WINGED}	2.	THE WINGED EDGE POLYHEDRON REPRESENTATION.
{GEOMED}	3.	GEOMED AS A GEOMETRIC MODELING COMMAND LANGUAGE.
{BIN}		4.	A POLYHEDRON INTERSECTION ALGORITHM.
{OCCULT}	5.	HIDDEN LINE ELIMINATION FOR COMPUTER VISION.
{CNTOUR}	6.	VIDEO IMAGE CONTOURING.
{CMPARE}	7.	IMAGE COMPARING.
{CAMERA}	8.	CAMERA SOLVING.
{VIS}		9.	COMPUTER VISION THEORY.
{CONCLU}	10.	CONCLUSION.

	APPENDICES:

{REF}			REFERENCES.
{GNODES}		GEOMED NODE FORMATS.
{CNODES}		CRE NOOE FORMATS.
TITLE PAGE - 2.						   JUNE 1974.

               GEOMETRIC MODELING FOR COMPUTER VISION.

---------------------------------------------------------------------
                           A DISSERTATION

           SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE

                AND THE COMMITTEE ON GRADUATE STUDIES

                       OF STANFORD UNIVERSITY

             IN PARTIAL FULFILLMENT OF THE REQUIREMENTS

                          FOR THE DEGREE OF

                        DOCTOR OF PHILOSOPHY

---------------------------------------------------------------------
                                 BY
                          BRUCE G. BAUMGART
                             JUNE  1974
---------------------------------------------------------------------
DETAILED TABLE OF CONTENTS.
LIST OF BOXES.
LIST OF FIGURES.
---------------------------------------------------------------------
ACKNOWLEDGEMENTS

Thesis Adviser:		John Mc Carthy
Readers:		Jerome A. Feldman
			Donald E. Knuth
			Alan C. Kay
People:
	Jerry	Agin,
	Leona 	Baumgart,
	Tom 	Binford, 
	Jack 	Buchanan,
	Les 	Earnest,
	Tom 	Gafford,
	Steve 	Gibson,
	Ralph 	Gorin,
	Tovar 	Mock,
	Andy 	Moorer,
	Hans 	Moravec,
	Richard	Orban,
	Ted 	Panofsky,
	Lou 	Paul,
	Lynn 	Quam,
	Jeff 	Raskin,
	Ron 	Rivest,
	Irwin	Sobel,
	Robert 	Sproull,
	Ivan 	Sutherland,
	Dan 	Swinehart,
	Russel 	Taylor,
	Marty	Tenenbaum,
	Arthur 	Thomas,
0.0	INTRODUCTION.
	
	"For  the purpose  of  presenting my  argument  I must  first
explain the basic  premise of sorcery as don Juan presented it to me.
He said that for a sorcerer, the world of everyday life is  not real,
or out  there, as we  believe it is. For  a sorcerer, reality  or the
world we  all know, is only a description. For the sake of validating
this premise  don  Juan concentrated  the best  of  his efforts  into
leading me  to a genuine conviction  that what I held in  mind as the
world at hand was  merely a description of  the world; a  description
that had been pounded into me from the moment I was born."

				- Carlos Castaneda. Journey to Ixtlan.


	This thesis  is about  computer techniques  for handling  3-D
geometric descriptions  of the world; the world  that can be visually
perceived with a television camera.   The overall design idea may  be
characterized as  an inverse  computer graphics approach  to computer
vision. In  computer graphics, the world is represented in sufficient
detail so that the image forming process can be numerically simulated
to  generate synthetic television  images; in the  inverse, perceived
television pictures (from a real  TV camera) are analysed to  compute
detailed geometric models. For example,  the  polyhedron in figure **
was automatically  computed from eight views of  a plastic horse on a
turntable. It is hoped,  that visually acquired 3-D  geometric models
can  be of  use  to other  robotic  processes  such as  manipulation,
navigation or recognition.


	Once acquired,   a 3-D  model can be  used to  anticipate the
appearance  of an  object or  a  scene of  objects making  feasible a
quantitative form of vision  by verification (feedback vision).   For
example,   the predicted  video appearance of  the two  machine parts
depicted  in figure ** can be computed,  figure **, and compared with
an  actual perceived  video  image,  figure  **.   By  comparing  the
predicted  image with a  perceived image, the  correspondence between
features of the internal model  and features of the external  reality
can be established (figure **), the precise location of the parts and
the camera  can be measured, and new  phenomena can be detected, such
as the little black cube in the lower left of the perceived image.

	The  following  chapters  proceed   from  theory,     through
implementation,   and back  to theory; with  the first  five chapters
dealing with modeling and the last five chapters dealing with vision.
The theory consists of two essays: the first,   on geometric modeling
in  chapter one  and  the second  on  vision in  chapter  nine.   The
implementation consists of two programs named GEOMED and CRE. CRE  is
a solution to the problem of finding intensity contours in a sequence
of television  pictures and of linking corresponding contours between
pictures.  GEOMED  is a system  of 3-D  modeling routines with  which
arbitrary  polyhedra may  be constructed,    altered,   or viewed  in
perspective with  hidden lines eliminated. It is a sequence of GEOMED
operations  that  generates  new  object  descriptions  from  contour
images.